Goto

Collaborating Authors

 Gaziantep


HInter: Exposing Hidden Intersectional Bias in Large Language Models

arXiv.org Artificial Intelligence

Large Language Models (LLMs) may portray discrimination towards certain individuals, especially those characterized by multiple attributes (aka intersectional bias). Discovering intersectional bias in LLMs is challenging, as it involves complex inputs on multiple attributes (e.g. race and gender). To address this challenge, we propose HInter, a test technique that synergistically combines mutation analysis, dependency parsing and metamorphic oracles to automatically detect intersectional bias in LLMs. HInter generates test inputs by systematically mutating sentences using multiple mutations, validates inputs via a dependency invariant and detects biases by checking the LLM response on the original and mutated sentences. We evaluate HInter using six LLM architectures and 18 LLM models (GPT3.5, Llama2, BERT, etc) and find that 14.61% of the inputs generated by HInter expose intersectional bias. Results also show that our dependency invariant reduces false positives (incorrect test inputs) by an order of magnitude. Finally, we observed that 16.62% of intersectional bias errors are hidden, meaning that their corresponding atomic cases do not trigger biases. Overall, this work emphasize the importance of testing LLMs for intersectional bias.


Open-Set 3D Semantic Instance Maps for Vision Language Navigation -- O3D-SIM

arXiv.org Artificial Intelligence

Humans excel at forming mental maps of their surroundings, equipping them to understand object relationships and navigate based on language queries. Our previous work SI Maps [1] showed that having instance-level information and the semantic understanding of an environment helps significantly improve performance for language-guided tasks. We extend this instance-level approach to 3D while increasing the pipeline's robustness and improving quantitative and qualitative results. Our method leverages foundational models for object recognition, image segmentation, and feature extraction. We propose a representation that results in a 3D point cloud map with instance-level embeddings, which bring in the semantic understanding that natural language commands can query. Quantitatively, the work improves upon the success rate of language-guided tasks. At the same time, we qualitatively observe the ability to identify instances more clearly and leverage the foundational models and language and image-aligned embeddings to identify objects that, otherwise, a closed-set approach wouldn't be able to identify.


Comparison of Machine Learning Methods for Assigning Software Issues to Team Members

arXiv.org Artificial Intelligence

Software issues contain units of work to fix, improve, or create new threads during the development and facilitate communication among the team members. Assigning an issue to the most relevant team member and determining a category of an issue is a tedious and challenging task. Wrong classifications cause delays and rework in the project and trouble among the team members. This paper proposes a set of carefully curated linguistic features for shallow machine learning methods and compares the performance of shallow and ensemble methods with deep language models. Unlike the state-of-the-art, we assign issues to four roles (designer, developer, tester, and leader) rather than to specific individuals or teams to contribute to the generality of our solution. We also consider the level of experience of the developers to reflect the industrial practices in our solution formulation. We collect and annotate five industrial data sets from one of the top three global television producers to evaluate our proposal and compare it with deep language models. Our data sets contain 5324 issues in total. We show that an ensemble classifier of shallow techniques achieves 0.92 for issue assignment in accuracy which is statistically comparable to the state-of-the-art deep language models. The contributions include the public sharing of five annotated industrial issue data sets, the development of a clear and comprehensive feature set, the introduction of a novel label set, and the validation of the efficacy of an ensemble classifier of shallow machine learning techniques.


Instance-Level Semantic Maps for Vision Language Navigation

arXiv.org Artificial Intelligence

Humans have a natural ability to perform semantic associations with the surrounding objects in the environment. This allows them to create a mental map of the environment, allowing them to navigate on-demand when given linguistic instructions. A natural goal in Vision Language Navigation (VLN) research is to impart autonomous agents with similar capabilities. Recent works take a step towards this goal by creating a semantic spatial map representation of the environment without any labeled data. However, their representations are limited for practical applicability as they do not distinguish between different instances of the same object. In this work, we address this limitation by integrating instance-level information into spatial map representation using a community detection algorithm and utilizing word ontology learned by large language models (LLMs) to perform open-set semantic associations in the mapping representation. The resulting map representation improves the navigation performance by two-fold (233%) on realistic language commands with instance-specific descriptions compared to the baseline. We validate the practicality and effectiveness of our approach through extensive qualitative and quantitative experiments.


LongForm: Optimizing Instruction Tuning for Long Text Generation with Corpus Extraction

arXiv.org Artificial Intelligence

Instruction tuning enables language models to generalize more effectively and better follow user intent. However, obtaining instruction data can be costly and challenging. Prior works employ methods such as expensive human annotation, crowd-sourced datasets with alignment issues, or generating noisy examples via LLMs. We introduce the LongForm dataset, which is created by leveraging English corpus examples with augmented instructions. We select a diverse set of human-written documents from existing corpora such as C4 and Wikipedia and generate instructions for the given documents via LLMs. This approach provides a cheaper and cleaner instruction-tuning dataset and one suitable for long text generation. We finetune T5, OPT, and LLaMA models on our dataset and show that even smaller LongForm models have good generalization capabilities for text generation. Our models outperform 10x larger language models without instruction tuning on various tasks such as story/recipe generation and long-form question answering. Moreover, LongForm models outperform prior instruction-tuned models such as FLAN-T5 and Alpaca by a large margin. Finally, our models can effectively follow and answer multilingual instructions; we demonstrate this for news generation. We publicly release our data and models: https://github.com/akoksal/LongForm.


Why is Britain experiencing so many earthquakes? Experts weigh in

Daily Mail - Science & tech

From Cornwall and Wales to Essex, Blackpool and the Norfolk coast, Britain has experienced a flurry of earthquakes in the past month. The biggest – a 3.8 magnitude tremor that struck Wales on February 24 – sparked panic as locals reported their beds started to move and walls shook. One resident in the small Welsh town of Abertillery not far from the epicentre said the quake was so noticeable'it felt like the roof was falling off'. The Welsh quake was preceded by several more including a 1.5 magnitude quake in Cornwall and a 3.8 magnitude event off the coast of Great Yarmouth. Here's all you need to know about the British tremors – including whether recent tectonic activity suggests a'big one' is soon to hit parts of the country.


Tweets Under the Rubble: Detection of Messages Calling for Help in Earthquake Disaster

arXiv.org Artificial Intelligence

The importance of social media is again exposed in the recent tragedy of the 2023 Turkey and Syria earthquake. Many victims who were trapped under the rubble called for help by posting messages in Twitter. We present an interactive tool to provide situational awareness for missing and trapped people, and disaster relief for rescue and donation efforts. The system (i) collects tweets, (ii) classifies the ones calling for help, (iii) extracts important entity tags, and (iv) visualizes them in an interactive map screen. Our initial experiments show that the performance in terms of the F1 score is up to 98.30 for tweet classification, and 84.32 for entity extraction. The demonstration, dataset, and other related files can be accessed at https://github.com/avaapm/deprem


Why are there so many earthquakes?

Daily Mail - Science & tech

Less than two weeks after the tragic earthquake that has killed more than 40,000 people in Turkey and Syria, another shake has rocked New Zealand. Wednesday's'widely felt' tremor, around magnitude 6, jolted both New Zealand's islands, although thankfully there's been no immediate reports of damage or injury. Earthquakes are happening all the time, from the ones too small to even be noticed to the devastating high magnitude quakes that lead to thousands of fatalities. But its occurrence so soon after the disaster in Turkey and Syria begs the question - could they be linked? Here, MailOnline takes a closer look at today's event and whether it's related to the catastrophic tremor in the Middle East last week.


Artificial intelligence mixes into production lines

#artificialintelligence

Interruptions in the supply chain during the coronavirus pandemic and problems in logistics that caused constant deviations from forecasts created significant problems for production-based economies. Production and logistics problems experienced at factories in China prompted Europe to turn to Turkey. When China and the U.S. moved containers to their own countries, Turkey started transporting them to Europe with trucks. At this stage, uninterrupted production necessitated technology-oriented transformation. This week, a Ventures60 event under the title "The Age of Uninterrupted Production" addressed a series of topics – from corporate intelligence solutions in the production of unmanned aerial vehicles used in Turkey's largest refinery, Tüpraş, to corporate investors investing in production-oriented artificial intelligence (AI) and cyberattack threats.


An Estimation of Personnel Food Demand Quantity for Businesses by Using Artificial Neural Networks

arXiv.org Machine Learning

Today, many public or private institutions provide professional food service for personnels working in their own organizations. Regarding the planning of the said service, there are some obstacles due to the fact that the number of the personnel working in the institutions is generally high and the personnel are out of the institution due to personal or institutional reasons. Because of this, it is difficult to determine the daily food demand, and this causes cost, time and labor loss for the institutions. Statistical or heuristic methods are used to remove or at least minimize these losses. In this study, an artificial intelligence model was proposed, which estimates the daily food demand quantity using artificial neural networks for businesses. The data are obtained from a refectory database of a private institution with a capacity of 110 people serving daily meals and serving at different levels, covering the last two years (2016-2018). The model was created using the MATLAB package program. The performance of the model was determinde by the Regression values, the Mean Absolute Percentage Error (MAPE) and the Mean Squared Error (MSE). In the training of the ANN model, feed forward back propagation network architecture is used. The best model obtained as a result of the experiments is a multi-layer (8-10-10-1) structure with a training R ratio of 0,9948, a testing R ratio of 0,9830 and an error rate of 0,003783, respectively. Experimental results demonstrated that the model has low error rate, high performance and positive effect of using artificial neural networks for demand estimating.